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Abstract 

Tree-based protocols are ubiquitous in distributed systems. They are flexible, they perform 
generally well, and, in static conditions, their analysis is mostly simple. Under churn, however, 
node joins and failures can have complex global effects on the tree overlays, making analysis 
surprisingly subtle. To our knowledge, few prior analytic results for performance estimation of 
tree based protocols under churn are currently known. We study a simple Bellman-Ford-likc 
protocol which performs network size estimation over a tree-shaped overlay. A continuous time 
Markov model is constructed which allows key protocol characteristics to be estimated, including 
the expected number of nodes at a given (perceived) distance to the root and, for each such 
node, the expected (perceived) size of the subnetwork rooted at that node. We validate the 
model by simulation, using a range of network sizes, node degrees, and churn-to-protocol rates, 
with convincing results. 

1 Introduction 

Tree-based protocols are used for many purposes in distributed computing and networking, in- 
cluding routing, multicasting, distributed monitoring, fault management, and data aggregation. 
Protocols that construct and maintain tree overlays for these purposes are often simple, and they 
are easy to analyze as long as the underlying network (graph) is static. In the case of node churn, 
however, tree maintenance can cause highly complex behavior. In fact, at sufficiently high churn 
rates, the overlays will never stabilize to a proper spanning tree, and the underlying network may 
become disconnected. In the case of an aggregation protocol, these phenomena can cause significant 
errors when estimating the system size, for instance. In this paper, our objective is to estimate this 
error, and, more generally, to present an analytic model of the tree maintenance and aggregation 
processes using continuous-time Markov models. 



The Protocol. We study the problem in the context of a simple, tree-based aggregation protocol 
named GAP [TJ, which is an asynchronous adaptation and extension of the self-stabilizing BFS 
construction algorithm of Dolev et al [3]. GAP is self-stabilizing in the sense that, once no churn 



occurs, the network and aggregation data eventually become static, the overlay forms a stable BFS 
tree, and the root has the correct aggregate, in time linear in the diameter of the network graph. 
A tree based aggregation protocol allows computing aggregates of local variables, such as node 
counts, max/min, sums, averages, distributions, and distributed selection queries, such as top k 
or A;'th smallest values, with good efficiency (cf. [10J). In this paper we restrict the discussion to 
counting, and the protocol thus estimates the number of nodes, or the network size. 

The Model. We model the protocol behavior using the master-equation approach of statistical 
physics, cf. [23]. For tractability we assume a standard Poisson-arrival model with equal join and 
failure rates Xj and Xf, in order to achieve a stochastic equilibrium. Each node maintains a local 
aggregate, a level variable indicating its belief of the distance to the root, and a pointer to its 
current parent. At a Poissonian rate X g , a node performs a protocol cycle, which includes sampling 
the state of its neighbors and updating its own state variables accordingly. We construct a model 
to estimate the expected value of the local aggregate for a node at level x (x indicating the distance 
in hops of the node from the root). The model is built in two steps. We first estimate the expected 
values of the quantities N x , the number of nodes at level x, which we then use to estimate the 
quantities M x , the expected values of the local aggregate of nodes at level x. Given that Nq = 1 
by construction, Mq represents the estimate of the global aggregate at the root, or, in the case of 
our protocol, the estimate of the network size. 

The Evaluation. The model is evaluated through simulation, for network sizes ranging from 
10 3 to 10 5 and rates r = X g /Xt between 1 and 1,000. In the case of r = 1 the tree overlay is 
highly unstable, and the protocol behavior is dominated by the join subprotocol. For r = 1,000, 
tree- maintenance is performed at a much higher rate, and a node executes an average of 1,000 
protocol cycles during its lifetime. Clearly, the effect of churn in the system becomes more visible 
the lower the value of r becomes^. The simulations show a good fit of the measurements to the 
analytic model for the parameter ranges examined. 

Related Work. A number of authors have studied analytical models of churn processes and their 
effects on the stability and performance of p2p systems, cf. [51 [22], [H] [7]. In earlier work [9], we 
used a model of churn similar to the one used here, to analytically estimate the performance of the 
Chord DHT under churn. The protocol studied here is far simpler and so we are able to model it 
at a much finer level of granularity than [9] where we had to treat lookups as atomic. 

The effects of churn on network connectivity have been studied by Leonard et al [13] . They relate 
node lifetime distributions to the likelihood that nodes get isolated, and estimate the probability of 
p2p system partitioning. Kong et al [8] study reachability properties of DHT systems and obtain 
analytic models of connectedness in terms of system size and node failure probability. Wu et al [23] 
model different DHT lookup mechanisms and evaluate analytically their performance under churn. 
Several authors |21] IT2] use stochastic fluid models to evaluate the performance of p2p streaming 
systems such as Bit Torrent. 

We know of few prior analytical performance analyses of tree-based protocols under churn in the 
literature. In the context of multicasting, van Mieghem [15] investigates the stability of multicast 
shortest-path trees by estimating the expected number of link changes in the tree after a multicast 
group member leaves the group. The analysis is done for random graphs and fc-ary trees. Lower 

Although, perhaps surprisingly, the effect of disconnection on level assignments is smaller. 
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bound results have been obtained for Demmer and Herlihys Arrow protocol [2j[IT], and the average 
case communication cost of Arrow is studied in [19], but only for the static case. 

Other authors have used analytic methods to model the effects of time-varying aggregation 
data, cf. pOj El [U [18], but again only for the case of static networks. 

Structure of Paper. Section [2] describes our network model and Section [3] an outline of the 
GAP aggregation protocol. In Sections U] and [5] we construct a stochastic model for the number 
N x of nodes with a certain distance x to the root in a system with a given rate of churn, and 
an estimated average partial aggregate A x held by a node at distance x. Section [6] presents the 
simulation results and we end by discussing the implications of our results. Further details of the 
calculations are relegated to the appendix. 

2 Network Model 

We model the network as an Erdos-Renyi random graph G(t) = (V(t), E(t)) with Poisson-arrival 
node join and (per node) failure rates Xj and Xf as described above. Nodes join with an a priori 

_TT* — 

Poissonian degree distribution Q(k) = e 4r, where A is the average degree of the new joinees, 
and attach themselves to nodes that are already present in G(t) with uniform probability. Node 
failures/deletions remove a node v chosen uniformly at random from V(t) along with all edges 
attached to v. We state the following observation without proof (cf. |17J). 

Proposition 2.1 Assume a network following the above conventions, and let Xj = NXf. 

1. The a posteriori degree distribution Pk(t) of G(t) is time independent and identical to Q(k). 

2. In expectation, the network size (number of nodes) of G(t) is N. 

3 The Protocol 

The protocol we analyze is based on the BFS algorithm of Dolev, Israeli, and Moran [3]. The graph 
has a distinguished root node vq which never fails, and which executes a program slightly different 
from the others. 

A node v G V(t) has three associated registers lvl v , agg v and par v holding the following values: 

• lvl v £ oo holds v.s level, i.e. v.s current belief in the number of hops needed to reach vq. 

• agg v € oj holds v.s partial aggregate: In this case v.s current belief in the size of the subtree 
rooted in v. 

• par v £ \J{v' € V(t') \ t' < t} indicates the node (which may no longer exist) which v believes 
to be its parent in the global aggregation tree. 

A configuration is a graph G(t) along with assignments to the three registers for each node in V(t). 
We write lvl v (t) when we want to refer to the value of lvl v at time t and similarly for the other 
registers. 

Each node randomly and independently executes updates, which are Poisson distributed events 
with rates X g . Let n v (t) be the neighbours of v at time t. When one node v executes an update at 
time t its registers are updated as follows: 
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• lvl v := min{lvl v i \ v' E n v (t)} + 1, where min(0) = oo and oo + 1 = oo. 

• agg v := Y,{agg v , \ v' € n v (t) A par v , =v} + l. 

• par v is some v' £ n v (t) with minimum level. If n v (t) = then par v = v. 

Upon joining, a node v updates its level and parent registers as above, and initializes its ag- 
gregate register to 1. We say that a node v joins at level x £ uj, or, joins at defined level, if, upon 
completed initialization, lvl v = x ^ oo. 

The root node updates only its aggregate register, as above, and initializes lvl vo to and par VQ 
to vo. 

For a fixed expected graph size N, the initial configurations at time t = are obtained from a 
graph (vq, 0) by nodes joining the graph according to the network model and join protocol above, 
until the graph has reached size N. 

It is not hard to see that in the static case, this algorithm converges in expected time A" 1 
times the diameter of the network. In fact, the algorithm is self-stabilizing [3J in the sense that 
convergence is independent of how the registers are initialized. 



4 The N x Model 

To estimate the long term average population at level x we use the following stochastic variables: 

• N x (t) estimates the number of nodes with level assignment x £ u) at time t. The level 
assignment may be correct in the sense that the node has a neighbour with minimum level 
assignment x—1 (if it is not the root), or it may be incorrect, in which case the level assignment 
will change upon an update. 

• N^ s (t) estimates the number of nodes at level x with incorrect level assignment. We call 
these nodes unstable. Clearly this is a subset of N x (t) 

• Nx(t) estimates the number of nodes with level x and with minimum neighbour level y. Upon 
performing an update, this node moves to level y + 1. 

Corollary 4.1 

N (t) = l, N x+1 (t) = N£ +1 (t) + N£. 1 (t), N$ s (t) = 0, 

In this analysis, we are mainly interested in expectations (time averages) in the steady state. So 
in most of what follows, we discard the time parameter and, for ease of notation, denote E[N X ] 
simply by N x . 

From the join protocol and in the large N limit, the probability of a node joining level x is 
denoted p m in{x){t): 

Pmi n(x ~ l)(t) = £ Q(k)[(l - g ^ - (1 - E ^) k ] (1) 
k i=0 i=0 
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This is the probability that at least one of the nodes' k links is connected to a node at level x 
while no link has a connection to a lower level. This quantity is averaged over k. 
The gain- loss terms for N x (t) are determined as follows: 



N x (t + At) 



N x (t) + 1 w.p. XjPminix - l)(t) 

N x (t) - 1 w.p. X f N x (t) 

N x (t) + 1 w.p. X g £ N*r\t) x' + x KL) 

N x (t) - 1 w.p. X g N™{t) 



The first (gain) term quantifies the change when a node joins and the second (loss) term accounts 
for node failures. The third term is the probability that a node with an incorrect level assignment 
actually has its lowest connection at level x — 1 and so is liable to join level x upon an update. This 
constitutes an influx into level x due to node updates. The last term quantifies the probability 
that a node with incorrect level x updates its level and is removed from level x. This term thus 
estimates the outflux from level x due to node updates. 

The above gain- loss terms translate to a time-dependent equation for N x , but in all that follows 
we are only interested in the steady state. That such a steady state exists (and is unique) for this 
process is guaranteed by the properties of continuous time Markov processes [16J. 

In steady state, the above gain-loss terms make the following prediction for N x 

JV, / NT 1 N us \ 

-jj- = Prrnnix - 1) + r f f- j (3) 

The term inside the brackets is the difference of the influx of nodes after an update event into level 
x, from an outflux of unstable nodes out of x. On general grounds, this term may be expanded in 
a series ^yk/ rk - The first term in the sum which is of order 1/r, when multiplied by r will be 
independent of r and hence to leading order, we expect N x to be independent of r. However if the 
term inside the brackets is non-zero, we would have sub-leading order corrections, and hence an 
r-dependence especially for low r. We make the important simplifying assumption that this does 
not happen: 

Assumption 4.2 In expectation, the influx of nodes into level x, Ylx'^x ^ x ~ l , is equal to the 
outflux of nodes from level x, N x s . 

Below we discuss why and under what conditions assumption 14.21 might be valid, but, if the as- 
sumption is valid then, from Eq. [3l we have N x /N = p m in{x — 1). Thus if influx and outflux are in 
equilibrium, the level distribution is independent of r and equal to the a priori distribution. Indeed 
this is confirmed by the simulation results, Figure [TJ with caveats related to the possibility of nodes 
disconnecting from the root. 

The main reason why assumption 14.21 might fail to hold is disconnection. The probability of 
disconnection decays exponentially in the degree A. For this reason we conjecture that assumption 
14.21 holds with probability overwhelming in A and N. 

Disconnection may happen for several reasons. Most interesting is the count-to- infinity problem 
where, upon disconnecting, a cluster of nodes' level registers begin to diverge. In this case we do not 
expect influx and outflux to be in equilibrium, as the lowest levels in that case have only outflux, 
and the highest only influx. In the absence of disconnection, however, we expect join and failures 
at any level x' < x to have opposing, but equal effects on the (eventual) instability at level x. If 
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this were not the case, there would be a continuous accretion or depletion of nodes from level x, 
something which would contradict both intuition and experimental evidence. 

In what follows, we make use of assumption 14.21 to estimate both the size of the perceived 
subnetwork rooted at a node at level x, as well as the number of unstable nodes at level x. 



5 Aggregation 

We now turn to the stochastic variables M x (t), which is the aggregate held by a node at level 
x. A x (t) = M x (t)N x {t) estimates the total aggregate held at level x. We can write the following 
gain-loss terms for this quantity 



A x (t + At) -» < 



A x (t) — M x (t) 
A x (t) + 1 
A x {t) - M x (t) 
A x (t) + M x ,{t) 

A x (t) - M x (t) + 1 + 



N x 



w.p. 


XfN x 


w.p. 


XjPminipC 1) 


w.p. 




w.p. 




M x+ i(t) w.p. 


\ g N° 



We account for all the processes, which in an infinitesimal unit of time, could change A x : Level 
x loses Mj in expectation upon failure of a node at level x; upon a join, A x is increased by one; 
upon an update of an unstable node leaving level x, M x is lost (in expectation); upon an update 
of an unstable node at level x' entering x, M x i is gained; finally, upon an update of a stable node, 
the current expected aggregate is lost, and the sum of the aggregates of the children plus one is 
gained. For the last term we estimate the expected number of children of a node at level x by the 
quantity d x = N^. +1 /N x where N£ = N x — iV" s represents the number of stable nodes at level x. 
This is a good estimate but not exact, as there might be nodes at level x + 1 which have a better 
choice of parent but do not as yet know this (not having executed an update). In this case, from 
the protocol, they would still send their aggregate to their outdated parent. However we ignore 
this effect in the theory. 

Using a x = A x /N = M X N X /N the steady state equation is : 



ajl + r) 



x'^x 



+ + ra x+l 



N S X N° X+1 



N 



N 



N x K 



x+l 



Further simplification and assumption 14.21 produces the following recursion relation: 



Pmin{x 1 ) ~\~ 



r + 1 



(a x 



N*N, 



+i 



2 + 1 



N x N, 



x+l 



+ ( M -' 

x'^x 



NT 1 
; N ' 



(4) 



(5) 



To solve ©, we need to estimate N£ (or equivalently, N^ s ), which is a function of r. We present 
this calculation in the appendix. The calculation involves estimating the outflux out of level x 
due to all the processes which might make a node at level x unstable: namely, the node has a 
single connection to the earlier level which fails; or the single connection is itself unstable and on 
an update makes the level-x node unstable; or a new joinee or one of the neighbors of the node 
provides a better choice of parent. We show from comparison to simulations, that our estimate is 
quite good for all the values of the parameters considered. 

To solve Eq. ([5]), we also need to estimate the last term which involves terms N^r 1 - To this 
end we make the simplifying assumption (consistent with assumption I4.2p . that for x' ^ x, N*~ l , 
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the influx to x from x', is roughly equal to the population at level x' times the ratio of outflux of 
level x to the entire population, i.e. 

ATUS _ 

xV x => N x x r l « -^N x , E (6) 

Note that this assumption can itself be used to estimate N x s and gives a far worse estimate than 
the one we derive in the appendix. However this assumption helps in making the last term in Eq. 
([5]) tractable. 

Using © Eq. ([5]) can then be simplified to 

i -I \ / x x+l iv x / / / -.w /— \ 

a x = Pmin{X — 1) H — [a x+1 — — h -jrr- > [a x > - p m in{X - 1)) (7) 

r + 1 N x N x+l N 

x'^0,l,x 

This equation can be solved numerically, once we have an expression for iV" s . Since the L.H.S has 
a sum over a x i , which is the quantity we want to compute, we solve the equation iteratively, to get 
a self-consistent solution. 



6 Validation Through Simulation 

We evaluate the model developed in Sections H] and [5] through simulation studies using a discrete- 
event simulator. We simulate the protocol described in Section [3] executing on a network modeled 
as described in Section (2) The events we simulate are nodes joining, nodes failing and the execution 
of protocol cycles on a node. (The protocol does not distinguish between a node failing and a node 
leaving the network.) 

When a join event occurs, a new node is created on the network graph, links from this node to 
other nodes of the network graph are created, the node's registers are initialized following the join 
protocol described in Section El and the node executes a protocol cycle. When a fail event occurs, 
the node is removed from the network graph, together with the links that connect it to other nodes 
in the graph. Finally, during the execution of a protocol cycle on a node, the registers of the node 
are updated as described in Section [31 whereby the node changes its parent only if a neighbor with 
a lower level than its current parent exists. 

During a simulation run, we periodically sample the metrics N, N x , A x and N x s . Each mea- 
surement point on the graphs in this section corresponds to values averaged over at least a 1,000 
such samples. 

All runs simulate the system in steady-state. The network graph for the simulation is a random 
graph with N nodes with a probability p that any pair of these nodes is connected, (p = X/N 
where A is the average node degree.) 

We perform simulation runs for all combinations of the following parameters: number of nodes 
(N): 1,000, 10,000, 100,000; average node degree (pN): 4, 6, 8; r: 1, 10, 100, 1,000. Runs for a 
single simulation can be very long for large networks and small r. For instance, for iV = 100, 000 
and r = 1, the simulation takes about a month on a single server to collect 1,000 samples. 

Using these simulation results, we validate the following predictions of our models, for steady 
state conditions: the level distribution, the accuracy of the counting protocol and the distribution 
of unstable nodes. The first two predictions are discussed in this section, while the last is discussed 
in the appendix. 

2 In fact the use of N here is a slight overestimation, as levels 0,1, and x itself need to be excluded. 
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Validation of Level Distribution (N x /N). In this series of experiments, we validate the N x 
model (see Section Hj) under steady-state conditions. 

Specifically, we compare the simulation results with the prediction of equation ([3|) . 

Figure [T] shows in a series of graphs, the numerical evaluation of equation ([3|), together with 
simulation results, for network sizes of 1,000 and 100,000, degrees of 4 and 8, and rates of 1, 10, 
1,000. 

We make the following observations. First, the model predictions fit the actual simulation 
results very well. As predicted, the distribution seems to be independent of the parameter r. 
Second, as expected, an increase of the network size, or a decrease of the node degree enlarges the 
average tree height. 



N=1,000 pN=4 N=1,000k=8 




(c) (d) 

Figure 1: Comparing theory with simulation results: the level distribution N x /N as a function of 
network size, node degree pN and r. 

Validation of A x /N . In this series of experiments, we validate the A x model (see Section [5]) 
under steady-state conditions. 

Specifically, we compare the simulation results with the prediction of equation ([7|) . 

Figure [2] shows, in a series of graphs, the numerical evaluation of equation (J7|), together with 
selected simulation results, for network sizes of 1,000, 10,000, 100,000, degrees of 4, 6, 8, and rates 
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Table 1: Relative errors of model prediction of Aq/N versus simulation results in steady state, for 
network size N, average node degree pN and r = X g /Xt. 

of 1, 10, 100, 1,000. 

These results suggest the following. First, the accuracy of the prediction of the model increases 
with r and N, and decreases with x. Second, we observe that the accuracy of the counting protocol 
decreases with increasing network size and, as expected, the accuracy increases with the degree of 
the network graph and the parameter r. 

Of specific interest to us is the accuracy of the counting protocol, which depends on the predic- 
tion accuracy of Aq/N. Table [1] gives the relative errors of the model prediction of Aq/N against 
the simulation measurements for various parameters. We notice that the predictions of the model 
are good for large values of r. For instance, for r = 100, i.e., for systems with an average lifetime 
of about 100 protocol cycles per node, the prediction error is at most 2%, for network graphs with 
average node degree of 6 or 8. We can also see that for a highly dynamic system with r = 1, 
the error is much larger. To illustrate the system dynamics, in the scenario with r = 1, pN = 8, 
and iV = 100,000, the model predicts Aq = 201, while the simulation gives Aq = 122, which is 
hopelessly wrong for a counting protocol. 

7 Discussion 

The contribution of this paper is an important first step towards a general theory of tree-based 
protocols under churn. We have shown that key performance metrics of a simple, tree-based ag- 
gregation protocol can be calculated systematically, and that these calculations are mostly in good 
agreement with simulation results. In addition, the equations we have derived can be solved numer- 
ically for any set of parameter values, to predict the expected accuracy of the estimate at the root, 
which requires only a few seconds on a regular laptop. (This is in contrast to a simulation, which, 
depending on the parameter values of interest, can take months on a single processor.) We believe 
that using such analytical tools to predict performance metrics can be of invaluable assistance 
in understanding the behavior of large-scale distributed information systems, and hence open up 
exciting avenues for engineering new protocols that execute effectively in dynamic environments. 

As the reader surely has noticed, the analysis presented in this paper, is not rigorous. Simplifying 
assumptions are made and theorems or exact bounds are not proved. However, this method follows 
a well-established practice in physics, namely that of deriving equations for expected values and, 
in the case they are not solvable, simplifying these while ensuring that global invariants (such as 
proposition 12.11 or assumption 14.21 ) are not violated. The simplifications of such equations usually 
fall into three categories. The first involves simplifications that affect the very form of the equation, 
by disregarding fluctuations that could lead to terms with higher moments. Equations for these 
higher moments would then possibly involve still higher moments, leading in most cases to an 
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Figure 2: Comparing theory with simulation results: A x /N as a function of network size, node 
degree and r. For x = 0, the curves show the accuracy of of the protocol for counting. 



unsolvable hierarchy of equations. Such an approximation is made, for instance, in Equation [5l in 
assuming that E\d x M x+ \] = E[d x ]E[M x+ i\. The second simplification appears in assumptions like 
those made in the estimation of d x or N^,~ • This type of simplification is a less serious modification 
of the theory than the first one, since a better estimate of these terms is all that is required for 
more accurate predictions. The third simplification involves using expressions, such as Equation. 
1, which strictly hold only in the large system limit. As we see from Figure [2] our fits do get better 
as TV increases. Proving this rigorously, however, is a very hard task and not addressed here. 
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8 Appendix 

8.1 Estimation of N^ s 

We present below the details of the calculation for N" s . This calculation goes through the usual 
steps of detailing gain/loss terms, finding appropriate expressions for these and solving the equation 
for the expectation value in the steady state. As in the earlier estimates, we use both proposition 
12.11 and assumption 14.21 Specifically, we use assumption 14.21 to justify our use of the a-priori 
distribution in determining quantities such as the fraction of nodes at level x, having only one 
connection to the level above. Note that we cannot (and do not) use the a-priori distribution 
to determine the average number of connections that a node has to the level below. This is a 
quantity which is set by the parameter r. But we are consistent with both proposition 12.11 and 
assumption 14.21 in assuming that the average number of connections to the level above (as well as 
the average number of connections within the same level) are independent of r, and given by the 
a-priori distribution. 

The x > 2 case. Since the root never fails, nodes at level 1 can never become unstable. We thus 
need to estimate unstable nodes only for level x > 2 We obtain the following gain-loss terms for 
N™: 
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The probabilities (A)-(C) are determined as follows: 

(A) This is the probability that a single connection at level x — 1 fails or becomes unstable. If 
the connection fails, by definition the node at level x automatically becomes unstable. If 
the connection becomes unstable, the level- x node becomes unstable only after the parent 
has perfomed an update, which explains why the second term in the following equation is 
multiplied by r. Introducing the notation A^(l) and N x (l) for the number of stable nodes at 
level x with exactly one parent and the number of nodes at level x with exactly one parent, 
respectively, (A) is then N x (l)(l + rN^i 1 /N x ^i)/N. We can write this as: 
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Using the a priori distribution as mentioned earlier, we take 
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(B) This is the probability that a new joinee with better level connects to a stable node at level 
x. If such an event occurs, then the node at x has a better choice of parent then the current 
one at x — 1, and hence becomes unstable. Using the a-priori distribution, We take (B) as 

r 2 x-3 
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This is the average number of connections to stable nodes at level x that a new joinee with 
a parent at level 0,1 ... up to x — 3, has. Such a new joinee will then make the node at level 
x change its level, when it updates. 

(C) The probability that a neighbor of a node at level x provides a better choice of parent, i.e. 
that at least one of the neighbor nodes is unstable with a better parent at level strictly less 
than x — 2. When such a neighboring node performs an update,this makes the node at level 
x unstable. 

We first estimate X x , the average degree of a node at level x using the a priori distribution. 
In order to not overcount the case of a single parent (case (A)), we consider all neighbours of 
a node with more than one connection to the parent level and all neighbours excluding the 
parent, for a node with only one connection to the parent level . X x then takes the value 
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Secondly, we estimate the probability of a neigbouring node being unstable with a parent at 
level 1, • • ■ , x — 3. We bound this probability by the sum ^2 i=2 x -2 This approximation 



x-2 



is the same as is made in Eq. [6j As a consequence we get (C) = r\ x J2i=2 ~ 
Putting the above together we obtain: 

N? s D + E + F 



where 



N x 1 + D + E 

D = — 5_(i + r _5_i) 

E = [A 2 e -Viv + X( e A/JV_ e -AEr = - 2 



(11) 



1 + r 



f = ij.vE 

1+r ^ N 

i=2 

Figure [3] shows in a series of graphs, the numerical evaluation of equation [3l together with 
simulation results, for network sizes network sizes of 1,000 and 100,000, degrees of 4 and 8, and 
rates of 1, 10 and 1,000. 

We see that Eq. [3] predicts simulation results quite well, including the non-monotonic trend in 
N x s /N x as x increases as well as the decrease in this quantity with increasing r. 
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Figure 3: Comparing theory with simulation results: the fraction of unstable nodes at level 
N£ s /N x as a function of network size, node degree and r. 
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